ACL - 08 : HLT 46 th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies
نویسندگان
چکیده
This paper studies the impact of written language variations and the way it affects the capitalization task over time. A discriminative approach, based on maximum entropy models, is proposed to perform capitalization, taking the language changes into consideration. The proposed method makes it possible to use large corpora for training. The evaluation is performed over newspaper corpora using different testing periods. The achieved results reveal a strong relation between the capitalization performance and the elapsed time between the training and testing data periods.
منابع مشابه
ACL - 08 : HLT 46 th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies
متن کامل
ACL HLT 2011 The 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
متن کامل
NAACL HLT 2009 Human Language Technologies : The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
متن کامل
ISBN : 978 - 1 - 61738 - 808 - 8 48 th Annual Meeting of the Association for Computational Linguistics 2010 ( ACL 2010 ) Uppsala , Sweden 11 - 16 July 2010
متن کامل
Creating Local Coherence: An Empirical Assessment
Two of the mechanisms for creating natural transitions between adjacent sentences in a text, resulting in local coherence, involve discourse relations and switches of focus of attention between discourse entities. These two aspects of local coherence have been traditionally discussed and studied separately. But some empirical studies have given strong evidence for the necessity of understanding...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008